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Abstract — For most wireless services with variable rate trans- 
mission, both average rate and rate oscillation are important 
performance metrics. The traditional performance criterion, 
utility of average transmission rate, boosts the average rate 
but also results in high rate oscillations. We introduce a utility 
function of instantaneous transmission rates. It is capable of 
facilitating the resource allocation with flexible combinations 
of average rate and rate oscillation. Based on the new utility, 
we consider the time and power allocation in a time-shared 
wireless network. Two adaptation policies are developed, namely, 
time sharing (TS) and joint time sharing and power control 
(JTPC). An extension to quantized time sharing with limited 
channel feedback (QTSL) for practical systems is also discussed. 
Simulation results show that by controlling the concavity of the 
utility function, a tradeoff between the average rate and rate 
oscillation can be easily made. 

Index Terms — Utility function, time-sharing, power control, 
rate adaptive, fairness. 



I. Introduction 

An important aspect of wireless systems is dynamic channel 
characteristics. One promising approach for addressing this 
issue is to dynamically allocate limited resources based on 
channel information and system preferences. Traditional inves- 
tigations on wireless resource allocation pay much attention 
to hard real-time services. Therein, the goal is to smooth 
out channel variation and build "bit pipes" that deliver data 
at a fixed rate. The rapid growth of the Internet has led to 
an increasing demand for supporting transmissions of best- 
effort service in wireless systems. These applications allow 
variable-rate transmission and are tolerant of high rate os- 
cillations. Therefore, opportunistic communications [1] have 
been introduced to achieve higher system throughput. The 
concept of opportunistic communications is essentially to 
transmit more information in good channel states and less 
in poor ones. Hard real-time service and best-effort service 
may be viewed as two extremes of rate-oscillation sensitivity. 
However, services such as many audio and video applications 
generally expect a balance between average rate and rate 
oscillation. If constant-rate transmission algorithms are used, 
the transmission efficiency would be very low. On the other 
hand, opportunistic scheduling schemes, such as [2] and [3], 
whose objective is to maximize a utility of average rates, 
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can improve efficiency in terms of average rate but result in 
high oscillation in instantaneous transmission rates. This thus 
motivated the need for a new criterion that can be used to 
facilitate the choice of the combinations of average rate and 
rate oscillation. 

In this letter we propose a new network objective function, 
namely, Time-average Aggregate concave Utility of instanta- 
neous transmission Rate (TAUR). To illustrate the underlying 
mechanism of the proposed objective function, let us consider 
transmitting a same data stream using two different schemes. 
For scheme one, the data stream is transmitted at a constant 
speed of 1 Mbit/s during the interval of 10 seconds. For 
scheme two, no data is transmitted in the first 9 seconds and 10 
Mbit/s is used for transmission in the last second. Obviously, 
the utilities of the two transmission schemes are identical if 
the utility is defined as a function of average transmission 
rate. However, the time-average concave utility as a function 
of instantaneous transmission rate for scheme one is higher 
than that for scheme two, which is expected if the degree of 
user satisfaction is concerned. Thus, the resource allocation 
based on TAUR should be able to balance the average rate 
and rate oscillation over time by adjusting the concavity of 
the utility function. 

The TAUR-based resource allocation problems are studied 
in a multi-user wireless system in an adaptive time-division 
fashion. We first consider the optimal time sharing (TS)-based 
scheduling policy in a backlogged system with constant power 
allocation. For a strictly concave utility, our analysis shows 
that the TS policy allows users with relatively better channel 
conditions to share a same time frame. We then propose a 
joint optimal time sharing and power control (JTPC) strategy 
where both the time-sharing fraction and the transmit power 
can be varied over time. In addition, a quantized TS policy 
with limited channel feedback (QTSL) is proposed for the 
ease of practical implementation. 

II. System Model 

We consider a single cell consisting of ./V mobile users 
communicating with a common base station. The communica- 
tion link between each user and the base station is modelled 
as a slowly time-varying fading channel with additive white 
Gaussian noise (AWGN). The channel coefficients remain 
approximately unchanged during each time frame, but can 
vary from one frame to another. Let the instantaneous channel 
gain of user i at any given time frame t be denoted by 
gi(t). The network channel gain is denoted by the TV-tuple 
§W - (ffi(i),52(i), ■ ■ -,gN(t)), and has a joint probability 



density function (PDF) /(g). Let Pi(t) denote the transmit 
power allocated to or from user i. The achievable transmission 
rate of user i in the absence of other users can be expressed 
as [4] 

Pi{t)9i(t) 



Ci{t) = log 2 



1 + 



I3N 



(1) 



where Nq is the noise power, and (3 is the signal-to-noise 
ratio (SNR) gap [4]. We assume that each time frame can 
be accessed by all the N users in an adaptive time-sharing 
fashion. Let p(g) = (px, p%, ■ ■ . , Pn) denote the time-sharing 
adaptation policy with respect to the network channel gain g, 
where pi represents the fraction of the frame duration allocated 
to user i. Without loss of generality, the interval of a time 
frame is normalized. The actual transmission rate of user i at 
the t-th time frame, r, (<), is given by rj(t) = pjCj(t). The 
frame index t in ri(t) and gi{t) may be omitted hereafter if 
no confusion occurs. 

The utility considered here is a function of the instantaneous 
transmission rate. For user i, we denote its utility as Ui(ri(t)). 
The exact expression for the utility £/•»(•) is not crucial. The 
analysis throughout this paper is valid for any utility function 
that is increasing, differentiable and concave. 

III. Time Sharing 

In this section, we assume that the transmission powers be- 
tween BS and mobiles are constant and identical for different 
users, i.e., Pi(t) = p,\/i,t, and that the wireless network is 
fully loaded. We choose the aggregate utility, which is the 
sum of individual user utilities, as the performance measure. 
The goal is to find the optimal time-sharing adaptation policy 
p* (g) relative to the instantaneous network channel condition 
g, so as to maximize the TAUR of the system. Since the 
channel processes are ergodic, the optimization problem can 
be expressed mathematically as 



max Its — 
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where notation E t [•] represents the time average. 

Since the constraint <f3j is defined for all channel states, 
the average aggregate utility maximization in (f2]i is equiv- 
alent to maximizing the instantaneous aggregate utility for 
every channel state. Furthermore, since the utility Ui(-) is a 
concave function of r*j by assumption, Ui(-) is also concave 
in pi. Therefore, taking the derivative of the Lagrangian 
Y,f=i U i(Pi(s),9i) + A[l - £ili /°i(g)]> and equating it to 
zero, we obtain p* as 



pKs) 



dpi 



(A) 



i = l,2,...,N. (4) 



In ©, (dUt/dpi) 1 (•) is the inverse function of 
{dUi/dpi)^^ and (x) + = max(0,a;). The Lagrange 
multiplier A can be determined using the constraint (01. As 
can be seen, the optimal time-sharing policy is only a function 
of the instantaneous channel conditions and is independent of 
the channel statistics. The explicit solution of the proposed 
optimal TS policy in a two-user network with log utility is 
discussed in [5]. 

We now compare the proposed TS policy for maximizing 
the time-average aggregate utility of instantaneous rate with 
the existing gradient scheduling (GS) policy [3] for max- 
imizing the aggregate utility of average transmission rate. 
As shown in [3], the GS policy maximizes the weighted 
sum of instantaneous rates in the system. In a time-shared 
wireless network, this results in choosing the user satisfying 
the following condition to transmit during the whole time 
frame: 

dUiiRiit)) 



i*(t) 



arg max 

Ki<N 



-Ci[t). 



(5) 



BRi(t) 

Here, Ri(t) is updated as R t (t) = (l-a)Ri(t-l)+aCi(t-l) 
for i = i*(t - 1) and = (1 - a)R t (t - 1) for Mi ^ 

i*(t — 1), with arbitrary initial value i?i(0), and a > is 
a fixed small parameter. On the other hand, in the proposed 
TS policy, it shall be clear in Section V that the decreasing 
marginal utility gives opportunity to the users in poor channel 
condition to share the time frame. 

Fig. Q] shows the simulated TAUR in the network by using 
the proposed TS policy and the existing GS policy. Here the 
utility of both policies is specified to have the form 



U(r) = In (l + j) , 



(6) 



where A > is a concavity indicator. The channels are 
assumed to be Rayleigh fading and the SNR gap is set to 8.2 
dB for all users which corresponds to a bit-error-rate require- 
ment of 10~ 5 when adaptive quadrature amplitude modulation 
(QAM) is used. The number of users in the network varies 
from 8 to 32 and their channel conditions are symmetric. The 
concavity indicator A is set at 0.1. Fig. |2] compares the mean 
and the standard deviation of the transmission rate achieved 
by TS and GS policies for A = 0.1, 1 and 10 with N = 32. 
The performances of the GS policy at different values of A 
are identical due to the channel symmetry among the users. In 
fact, it is an extreme case of our TS policy when A — > oo 
. Although GS can obtain the maximum average rate, the 
standard deviation of the rate increases rapidly as the average 
SNR increases. The TS policy, on the other hand, has a flexible 
balance between the average rate and the rate oscillation 
through adjusting the concavity of the utility function. 

IV. JOINT TIME-SHARING AND POWER CONTROL 

In this section we allow both the transmission time and 
power to change with respect to channel conditions in each 
time frame. The optimization problem in Section [HI] is ex- 
tended to finding the joint optimal time-sharing and power 

'When the utility function is strictly concave, (dUi/dpi){-) is a monoton- 
ically decreasing function of p and, hence, its inverse exists. 



control policy. Uplink and downlink transmission are con- 
sidered separately due to different power constraints. For the 
uplink, the power source is, generally, rechargeable batteries 
attached to the mobile devices. Thus, the optimization is sub- 
ject to each user's average power constraint. Mathematically, 
this can be represented as 

r N 

max ijTPc = / z2Ui(pi(g),Pi(s),9i)f(s)dS ( 7 ) 

(P,p) Jzt! 
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/ Pi(g)Pi(g)/(g)dg =Pi, i = 1,2, . 
J s 
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,N, (9) 



where pi is the average power constraint of user i. 

Note that the utility function U(p,p,g) is concave in p and 
p separately based on our assumption, but not in both p and 
p. Moreover, the equality constraints in (0 are nonlinear. To 
make the problem more tractable, we define s = pp. It can 
be shown that U(p,s,g) is concave in both p and s (since 
its Hessian matrix is negative semidefinite). The problem thus 
falls into the classic calculus of variations [6]. Applying the 
Euler-Lagrange equation results in the following necessary and 
sufficient conditions for the optimal solution p* and s*: 

dll 

, '+A i = 0, i=l,2,...,JV (10) 



dsi 
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dpi 



/(g) + A (g) =0, 



1,2,. ..,7V (11) 



where Ao(g) and Aj are Lagrange multipliers and determined 
by constraints ([8]l and (|9). The closed-form solutions to the 
above equations are generally difficult to obtain due to nonlin- 
earity of the utility function U in p and s. The nonlinear Gauss- 
Seidel algorithm [7] can be used to search for the optimal time- 
sharing vector p* and vector s* = [s*, 8%, ■ ■ ■ , s* N ] under the 
average power constraint p = \pi,p2, ■ ■ . ,Pn]- It is outlined 
as follows. 

1) Initialize s: set p (0) = Np, pi = 1/N, Vi, let j = and 
calculate the initial s' '. 

2) Update p&' : given find the optimal time-sharing vector 

rXi) 



ig CED 
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dUj 
dpi 



i = 1,2, 



,N, 



where \' satisfies l[8j 

3) Compute the average aggregate utility using @ 

4) Update s 



O+i). 



given p , find the optimal power control 



vector s 



using 
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for i = 1,2, 



where Xi satisfies $9$ for i = 1, 2, . . . , N and let j = j + 1. 
5) Repeat Steps 2)-4) until /O) < A, where A is a small 

number. 

At each iteration, the optimization of p and s is carried 
out successively. Steps 2) and 4) involve only the calculation 
of a one-dimensional maximization problem whose solution 
is given in Section [HI] The condition that U is continuously 



differentiable and concave in (p, s) guarantees the convergence 
of the nonlinear Gauss-Seidel algorithm. The proof can be seen 
in [7, Prop 3.9 in Section 3.3]. 

The problem formulation for the downlink differs from 
the urjlink only in the power constraint, which is given by 
J g E l= iP«(g)K(g)/(g) d g = P- A similar problem-solving 
approach to the one proposed for the uplink can also be 
obtained and hence is omitted. 

Although JTPC utilizes two degrees of freedom in resource 
allocation and has much higher computational complexity, its 
performance is not expected much higher than that of TS in 
the high SNR region. This is attributed to the fact that the 
transmission rate is linear in time, but concave in transmission 
power. That is, at high SNR, the gain from power control 
is smaller than from time sharing adaptation. This is also 
verified by the simulation results shown in Fig. [3] where 
we compare the average aggregate utilities obtained by the 
proposed TS and JTPC schemes. It is observed that at high 
SNR the performance gain of JTPC over TS is not noticeable. 
Hence, in Section [V] we assume the absence of power control. 

V. Implementation Issues 

We have provided the analytical results for TS and JTPC 
policies. In this section, we address some important imple- 
mentation issues, including quantized time sharing fractions, 
limited channel feedback, and fairness. 

A. Quantized Time Sharing With Limited Channel Feedback 

In scheduling the downlink transmission, the BS needs to 
know each user's channel state information (CSI). This could 
be gained by sending the CSI from each user to the base 
station through a feedback channel upon channel estimation 
at each user terminal. In practice, perfect channel feedback 
is not feasible due to limited capacity of the feedback links. 
We assume in this subsection that the channel estimate of each 
user is quantized into K = 2 M regions using M bits. Let G = 
{Gi, G2, . . • , Gk+i] be the set of channel gain thresholds in 
increasing order with G\ = and Gk+i = 00. If the channel 
gain of user i falls into range [Gk, Gk+i), we say user i is in 
channel state k, and denote it as Si = k. Suppose we apply the 
equal-probability method to do the channel partitioning and the 
channel gains follow exponential distribution, the threshold set 
<G can be determined easily. 

Furthermore, time sharing fractions in practice cannot be an 
arbitrary number, but are restricted to a finite set of values due 
to switching latency and difficulties in rigid synchronization. 
Therefore, we assume that a time frame is partitioned into L 
slots with equal length. Correspondingly, the number of users 
which can transmit in the same frame is limited by L. At 
the beginning of each frame, the base station computes the 
optimal time-sharing vector p* defined in ( fl2l upon obtaining 
the network channel states S = [Si, S2, ■ ■ ■ , Sn]- 

The time sharing policy considered here maps the current 
channel states S € K^X to a time-sharing vector To avoid 
the exponential complexity in exhaustive search, an online 
greedy algorithm with complexity of O(LN) is proposed. 
Beginning with an initial solution p = [0, 0, . . . , 0], each time 
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slot is assigned at one iteration to the most favorable user that 
maximizes the increment of the current objective till the total 
L slots are traversed. The greedy algorithm is outlined below: 
1) Initialization 



Let v = (the index of the time slot), 

o (Vie {i,...,N}). 



and U. 



((i) 



2) Allocate the (v + l) th 

+ 1/X and +1) 



time slot to the user indexed by i 



arg 



max 
;e{i,...,w> 



3) 



Let p. 
Let v 



(v + l) 



Pi for i 7^ i* . 
L 



(13) 



v + l, and return to Step 2) until v 
It is shown in the appendix that this algorithm leads to the 
optimal solution to Problem (fT~2t . 

Fig. |4] shows the performance of the quantized time sharing 
with limited channel feedback (QTSL) policy. The CSI is 
quantized at 3-bit resolution, and the number of time slots 
is the same as the number of users in the network. It is seen 
that the performance of QTSL approaches that obtained by 
the optimal TS policy. In Fig. [5] we illustrate the performance 
of the QTSL policy when the number of time slots in a time 
frame is half of the number of users in the network. This time 
we also vary the number of channel feedback bits from 1 to 
3. There is a performance gap between QTSL and TS, but the 
average aggregate utility obtained by QTSL with TV = 32 and 
L = 16 is still higher than that of the optimal TS policy with 
N = 16. In addition, the performance gain is limited when 
the CSI is quantized using more than two bits. 

B. Fairness Guarantee 

The scheduling schemes based on the aggregate utility max- 
imization developed in previous sections may not guarantee 
fairness for users with different channel statistics, such as path 
loss and shadowing. This is because the TS scheme tends to 
allocate more time to the user with higher average SNR. A 
commonly used fairness criterion in computer networks is the 
max-min fairness [8]. Unlike wireline networks, the wireless 
network suffers from time-varying channel impairments. Thus, 
time-average utility max-min (TUMM) fairness, defined be- 
low, is meaningful in wireless networks. 

Definition 1: A time sharing policy p is time-average utility 
max-min fair if for each i S {1,2, ...,7V} and any other 
time sharing policy p for which E[Ui(pi)] < E[£/i(pi)], there 
exists some j with E[Ui(pi)] > E[Uj(pj)] > E[Uj(pj)], i.e., 
increasing some component E(Ui(pi)) must be at the expense 
of decreasing some already smaller component E[Uj{p~j)] 

Since the resource allocation based on TUMM fairness does 
not concern the fairness at any instant, it allows the sched- 
uler to exploit the instantaneous fluctuation of the channel 



conditions. We formulate the TUMM-based time sharing as a 
multiple-objective programming problem: 



max 
Pis) 

s.t. 



1,2, 



,7V 



(14) 
(15) 



E[U i (p i (g),g i )}=a, 

N 

$>(g) = i, 

i=l 

where a is a variable to be maximized. When t/j(-)'s are all 
strictly concave, there exists a unique solution. 

A frequently used method in multiple objective pro- 
gramming problem is the point estimate weighted-sum ap- 
proach [9]. In this method, each objective is multiplied 
by a weight Wi. Then, the N weighted objectives are 
summed to form a weighted-sum objective function, denoted 
as YliLi w i^[Ui(pi)]- The weights Wi's are chosen such 
that (TT~5T > is satisfied. It can be proven that the solution to 
maximize the average aggregate weighted utility is also Pareto- 
optimal. Take the two-user case for example. The notion of 
fairness can be realized by dynamically adjusting the weights, 
i.e., dynamically adjusting the moving direction towards the 
intersection of the Pareto-optimal frontier and the straight line 
E[Ui] = EfUjj] while keeping the two users' average utility 
on the Pareto-optimal frontier. This weight adaptation method 
can be used to guarantee fairness when the users have different 
channel distributions. 

VI. Conclusion 

We develop a new framework for resource allocation in 
wireless networks for variable-rate transmission. The time- 
average aggregate utility of instantaneous transmission rate is 
proposed to jointly optimize the resulting average rate and 
rate oscillation. In particular, a time-sharing policy and a joint 
time-sharing and power control policy are designed to exploit 
the channel fluctuation. The effects of partial channel state in- 
formation and discrete time sharing fractions are also studied. 
Furthermore, an adaptive method to guarantee strict fairness 
among users with different channel statistics is discussed. 

Appendix 

Optimality Proof of The Greedy Algorithm 

Consider one realization of S and pi 6 {0, 1/L, 2/L, . . . , 1}. 

For simplicity, we define U l {p l ) := J G g i+1 Ui(pi(g), gi)dgi and 

di(p) — Ui(p) — Ui(p — 1/L). Let Dl denote the set of L largest 
elements in D = {di(p)\i = 1,2, ... ,N,p = 0, 1/L, 2/L, . . . 1}. 
Then, the Lagrangian can be written as: 

N N 

^(Uiip^-Xpi) = J2[Ui(Q)+(di(l/L)-X)+. . .+(di(Pi)-X)]- 

i=l i=l 

(16) 



Let A be the smallest di(p) in Dl, then 
di(p) - A 

Therefore the Lagrangian d 1 6b is maximized by 



>0 ifdi(p)eD L 
< otherwise 



[ ifdi{l/L)<£D L 
p\ = t 1 ifdi(l)€D L 

p if di(p) € D L and d^p + l/L) £ D h . 



(17) 



(18) 



Due to the concavity of the utility function, we have di(l/L) > 
di(2/L) > ... > di(Ni/L) > di((Ni + l)/L) > ... > di(l). 
Here suppose that d z (Ni/L) £ Dl and di((Nj + l)/L) $ D L , then 
J2i Ni = \D L \. Since p* = N t /L from {H, £. = |L>z,|/£ = 1 
holds for the chosen A. p* is an optimal solution of Problem < l 12b . 

Since Step 2) computes the L largest di(p) in the decreasing order 
of their values, the greedy algorithm obtains the optimal solution. 



References 

[1] P. Viswanath, D. Tse, and R. Laroia, "Opportunistic beamforming using 
dumb antennas," IEEE Trans, on Info. Tlieory, vol. 48, no. 6, pp. 1277— 
1294, June 2002. 

[2] A. Jalali, R. Padovani, and R. Pankaj, "Data throughput of cdma-hdr a 
high efficiency-high data rate personal communication wireless system," 
in Proc. Veh. Technol. (VTC), 2000. 

[3] A. L. Stolyar, "On the asymptotic optimality of the gradient schedul- 
ing algorithm for multiuser throughput allocation," Operation Research, 
vol. 53, no. 1, pp. 12-25, Jan. 2005. 

[4] X. Qiu and K. Chawla, "On the performance of adaptive modulation in 
cellular systems," IEEE Trans, on Commun., vol. 47, no. 6, pp. 884-895, 
June 1999. 

[5] X. Zhang, M. Tao, and C. S. Ng, "Time sharing policy in wireless 
networks for variable rate transmission," in Proc. IEEE ICC 2007, June 
2007. 

[6] B. S. Gottfried and J. Weisman, Introduction to optimization theory. 

Prentice-Hall, 1973. 
[7] D. P. Bertsekas and J. N. Tsitsiklis, Parallel and Distributed Computation: 

Numerical Methods. Athena Scientific, 1997. 
[8] J. M. Jaffe, "Bottleneck flow control," IEEE Trans. Commun., vol. 29, 

no. 7, pp. 954-962, July 1981. 
[9] R. E. Steuer, Multiple Criteria Optimization: Theory, Computation, and 

Application. Wiley, New York, 1986. 






TS. N=8 
TS, N.1 6 
TS, N.32 


— e— 

— A— 






GS, N-8 
GS, N.16 
GS, N=32 







Average SNR(dB) 



Fig. 1 . Performance comparison of different resource allocation schemes 
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Fig. 2. Average rate and standard deviation of rate versus average SNR 
using TS and GS 
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Fig. 3. Average aggregate utility versus average SNR using TS and JTPA 
for two-user scenario 




Fig. 5. Performance comparison of TV = 32 users with L = 16 and N = 16 
users with L = 8 time slots, and with M = 1 , 2 and 3 feedback bits 



